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Abstract 

Two kinds of evolving trees are considered here: the exponential trees, 
where subsequent nodes are linked to old nodes without any preference, 
and the Barabasi-Albert scale-free networks, where the probability of link- 
ing to a node is proportional to the number of its pre-existing links. In 
both cases, new nodes are linked to m = 1 nodes. Average node-node 
distance d is calculated numerically in evolving trees as dependent on the 
number of nodes A^. The results for A'' not less than a thousand are aver- 
aged over a thousand of growing trees. The results on the mean node-node 
distance d for large N can be approximated by d = 2 ln(A'^) -I- ci for the 
exponential trees, and d — ln(A) + C2 for the scale-free trees, where the d 
are constant. We derive also iterative equations for d and its dispersion 
for the exponential trees. The simulation and the analytical approach give 
the same results. 

Keywords: exponential trees, scale-free trees, evolving networks, small-world 
effect 



1 Introduction 

In recent five years, much attention has been paid to the problem of evolving 
networks El- For our purposes, the problem can be summarized as follows: 
we consider a graph, initially small, of N nodes. A new {N + l)-th node is 
linked to m nodes, selected randomly from N previously existing nodes, with 
probability of linking to a given node dependent on its degree, i.e. its number 
of edges k. If this probability is constant, we get the so-called exponential 
network — this term is due to the exponential distribution P{k) of the number 
of edges of a node 2 . The distribution P{k) is often treated as giving most 
essential local characteristics of the network. Example giving, for the Cayley 
tree P{k) = Sk,3. If the probability of linking to a node is proportional to the 
node degree fc, the network is scale-free, i.e. the distribution P{k) oc fc~^, where 
7 « 3.0. This kind of networks is called "Barabasi-Albert networks" from the 
names of the inventors 0] . The scale-free character has been discovered in many 
real networks: a network of web pages which are available one from another, a 
network of coauthors of scientific papers, a network of actors playing roles in 
the same movies and some others. In any case, the evolving procedure is to be 
considered as a non-equilibrium process. 

Here we focus on the mean distance d between nodes in trees, where m = 1. 
In trees, the path between each two nodes is unique and it cannot be changed 
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Figure 1: Construction of the distance matrix. 



during the system evolution. A number of results on d in trees has been derived 
recently for uncorrelated random networks [HI El IE] • These results are based 
on scaling hypotheses and/or assumptions of lack of correlations between nodes. 
The goal of the present work is to calculate d{N) numerically for the exponential 
trees and the Barabasi-Albert m = 1 scale-free networks, and analytically for 
the exponential trees. Our analytical results are formulated in terms of iterative 
equations and they are exact also for small trees. 

2 Numerical calculations and results 

The numerical algorithm is as follows: an initial tree is composed of two nodes 
connected by an edge. Each time when a (A^+l)-th node is added to a q-th node, 
the distance s{N + between the new node and any other (say, i-th one) is 
1 + s{q,i). The set of distances can be presented as a symmetric matrix s{i,j). 
Adding a new node is represented by an enlargement of the matrix size by one. 
The last {N + l)-th column and row are equal to one plus the g-th column (or 
row), where q is the index of a node to which the new node is attached (see 
Fig.d. 

The mean distance, i.e. the average length d, is just the average over the 
nondiagonal matrix elements. Obviously, the diagonal elements are zero. The 
obtained curve d{N) is an average over Nmn — 10^ trees. 

The results on (d) (hereafter abbreviated as d) for the exponential trees and 
for the scale-free trees they are shown in Fig. El 

For large the obtained curves can be well approximated as linear with 
In(iV). Namely we get d{N) = 1.981n(A^) — 2.69 for the exponential trees and 
d{N) — l.OOln(A^) -1-0.04 for the scale-free trees. Preliminary numerical results 
on the dispersion for both kind of trees suggest that for large A'^, the dispersion 
cr'^{N) = a\ia{N) + b, where a is the same as for the average distance d, i.e. 2.0 
and 1.0 respectively for the exponential and the scale- free trees. 

3 Analytical considerations 

During the tree growth, the size of the matrix s{i, j) of distances between nodes 
increases. Then, the time evolution of a tree is equivalent to the evolution of 
the matrix size N. We consider the evolution of the ensemble average of the 
distance matrix s{i,j). The mean distance d{N) between nodes, averaged over 
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Figure 2: The average distance d for the exponential and the scale- free trees. 



all pairs of nodes and trees tj^ with N nodes can be written as 



1 ^ 



(1) 



where Pit^) is the probability of a tree tj^ of size N, and it is normalized 
to unity within the set of all possible trees with N nodes. Although we are 
interested in d, it is easier to work just with the average matrix element s{N), 
which includes diagonal elements s{i,i) 
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The relation {N — l)d{N) — Ns{N) comes from the obvious fact that the 
diagonal element s{i,i) = 0. The evolution of s{N) is given by 
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N N 

^-P(iw+i) ^ Sij{tN+i) + 2y^ P{tN+i) Si,jv+i(tjv+i) 



(3) 



because the matrix s{i,j) is symmetrical and s{N + 1, N + 1) = 0. In the 
first sum of the latter expression, the average over trees with (A'' + 1) nodes 
is performed over the sum of matrix elements which depend on the subgraph 
tN- That is why in fact P{t]y-^-l) is the boundary distribution P(tjv), and 
the information on the {N + l)-th node is averaged out. In the second sum, 
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P(tjv+i) = P{tN)P{q\tN), where the latter is the conditional probability of 
attaching a new node to a preexisting node indexed by q. On the other hand, 
we have again s{i, N + I) = I + s{i, q). This is the same rule which has been 
helpful in the numerical calculations, described above. Substituting all that to 
Eq. (3), we get 

N 

{N + lfs{N+l)=Y^P{tN) J2 Si,j{tN) + 2N+ 

(4 

+ 2 P{tN) PiQ\tN) Y ''^M- 

t q=l j=l 

The first term on r.h.s. is nothing but N'^s{N); the problem is with the last 
term. For scale-free trees, the probability P{q\tN) is proportional to the degree 
of the node q in the tree tj^, i.e. it is equal to "fk(q), where 7 is a trce-dcpcndcnt 
constant. Substituting it to Eq. (4), we obtain the relation between the average 
distance between nodes of trees with {N + 1) nodes and the average distance 
from a node of degree k to any other node in trees with N nodes. This relation, 
even if natural, seems hard to use further. 

On the contrary, for the exponential trees P{q\tN) = 1/N. This term can 
be extracted out from the sum and we get a compact iterative relation 

{N + lfs{N + 1) = N'^s{N) + 2N + 2Ns{N). (5) 

Substituting d{N) = Ns{N)/{N - 1), we get 
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The same manipulations are possible with the moments of higher order. The 
average of the squared matrix element is 



1 ^ 

Applying it to the exponential trees, we get the relation 

^ ' N{N + l) ^ ' N{N+l) ^ ^ iV+1 ^' 

We start from the only existing tree of = 2; obviously d{2) — 1, e(2) = 1. 
Successive values of d{N), cf'^{N) — e{N) — d^{N) are shown in Fig. |21 together 
with the respective results of the simulation. 

As we see, the agreement is good. For high values of A^, the plots are 
linear with ln(A^^). For 10^ < N < 10^, the obtained coefficients are: d{N) — 
2.01n(A^) - 2.84, aj{N) = 2.01n(A^) - 1.44. 



4 Conclusions 

For both kinds of trees, the terms of d and cr^ leading in TV are proportional 
to In(A^), with apparently the same coefficients (2.0 and 1.0 for the exponential 
trees and the Barabasi-Albert scale- free trees, respectively). This means, that 
in both cases the term in ct^ proportional to In^(A^) cancels. 

The selecting of subsequent nodes to attach new nodes is equivalent to a 
random walk in the space of all trees. At each step, the number of possibilities 
— nodes where a new node can be attached — increases. In principle, the 
number Nmn of trees used in the averaging procedure should increase in time 
to have a good statistics, but such a simulation would be time-consuming. If 
Nrun is much smaller than the maximal size A^ of trees, the averaging is not 
ergodic and numerical errors can increase. 

In our formulation, the weights of particular trees of size (A^ -f- 1) do depend 
on the weights of trees of size A^ from which they are formed. This dependence 
is specific for the process of growing of the trees. In consequence, our results 
differ from those obtained within the equilibrium statistical mechanics, e.g. in 
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